Quick audio retrieval using active search
نویسندگان
چکیده
This paper discusses a method to search quickly through broadcast audio data to detect and locate known sounds using reference templates, based on the active search algorithm and histogram modeling of zero-crossing features. Active search reduces the number of candidate matches between reference and test template by up to 36 times compared to exhaustive search, while still remaining optimal. Computation is further reduced by using computationally inexpensive zero-crossing features. The method is robust against white noise addition down to 20dB signal-to-noise ratios and digitization noise.
منابع مشابه
Very quick audio searching: introducing global pruning to the Time-Series Active Search
Previously, we proposed a histogram-based quick signal search method called Time-Series Active Search (TAS). TAS is a method of searching through long audio or video recordings for a specified segment, based on signal similarity. TAS is fast; it can search through a 24-hour recording in 1 second after a query-independent preprocessing. However, an even faster method is required when we consider...
متن کاملA quick search method for audio and video signals based on histogram pruning
This paper proposes a quick method of similaritybased signal searching to detect and locate a specific audio or video signal given as a query in a stored long audio or video signal. With existing techniques, similarity-based searching may become impractical in terms of computing time in the case of searching through long-running (several-days’ worth of) signals. The proposed algorithm, which is...
متن کاملScalable Metadata and Quick Retrieval of Audio Signals
Audio search algorithms have reached a degree of speed and accuracy that allows them to search efficiently within large databases of audio. For speed, algorithms generally depend on precalculated indexing metadata. Unfortunately, the size of the metadata follows the same exponential trend as the audio data itself, and this may lead to an exponential increase in storage cost and search time. The...
متن کاملSpeechfind: an experimental on-line spoken document retrieval system for historical audio archives
In this study, we present the SpeechFind system, an experimental on-line spoken document retrieval system for historical audio archives. As part of an on-going U.S. NSF Digital Library Initiative project, entitled the National Gallery of the Spoken Word (NGSW), SpeechFind is intended to serve as an audio index and search engine for spoken word collections spanning the 20th century with as much ...
متن کاملA Framework to Provide Fine-Grained Time-Dependent Context for Active Listening Experiences
[1] Joren Six and Marc Leman, Panako A Scalable Acoustic Fingerprinting System Handling Time-Scale and Pitch Modification in Proceedings f the 15th ISMIR Conference (ISMIR 2014) [2] Joren Six, Olmo Cornelis, and Marc Leman. TarsosDSP, a Real-Time Audio Processing Framework in Java. In Proceedings of the 53rd AES Conference (AES53rd), 2014. [3] Avery L. Wang. An Industrial-Strength Audio Search ...
متن کامل